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Abstract 

For delay-limited communication over block-fading channels, the difference between 
the ergodic capacity and the maximum achievable expected rate for coding over a finite 
number of coherent blocks represents a fundamental measure of the penalty incurred by 
the delay constraint. This paper introduces a notion of worst-case expected-capacity 
loss. Focusing on the slow- fading scenario (one-block delay), the worst-case additive and 
multiplicative expected-capacity losses are precisely characterized for the point-to-point 
fading channel. Extension to the problem of writing on fading paper is also considered, 
where both the ergodic capacity and the additive expected-capacity loss over one-block 
delay are characterized to within one bit per channel use. 

1 Introduction 

Consider the discrete-time baseband representation of the single-user flat-fading channel: 

Y[t] = y/G\t}X[t]+Z[t] (1) 

where {A^[t]} are the channel inputs which are subject to an average power constraint P, {G[t]} 
are the power gains of the channel fading which we assume to be be unknown to the transmitter 
but known at the receiver, { .£[£]} are the additive white circularly symmetric complex Gaussian 
noise with zero means and variances Nq, and are the channel outputs. As often done in 

the literature, we shall consider the so-called block-fading model pQ where {£?[£]} are assumed 
to be constant within each coherent block and change independently across different blocks 
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according to a known distribution F G (-). The coherent time of the channel is assumed to be 
large so that the additive noise can be "averaged out" within each coherent block. 

The focus of this paper is on delay-limited communication for which communication is 
only allowed to span (at most) a total of L coherent blocks where L is a finite integer. In 
this setting, the Shannon capacity is a very pessimistic measure as it is dictated by the worst 
realization of the power-gain process and hence equals zero when the realization of the power 
gain can be arbitrarily close to zero. An often-adopted measure in the literature is the expected 
capacity [2H5], which is defined as the maximum expected reliably decoded rate where the 
expectation is over the distribution of the power-gain process. 

The problem of characterizing the expected capacity is closely related to the problem of 
broadcasting over linear Gaussian channels [2H5]. The case with L = 1 represents the most 
stringent delay requirement known as slow fading [I]. For slow-fading channels, the problem of 
characterizing the expected capacity is equivalent to the problem of characterizing the capacity 
region of a scalar Gaussian broadcast channel, which is well understood based on the classical 
works of Cover [6] and Bergmans [7J, and then finding an optimal rate allocation based on the 
power-gain distribution. For L > 1, the expected capacity can be improved by treating each 
realization of the power-gain process as a user in an L-parallel Gaussian broadcast channel and 
coding the information bits across different sub-channels [HIHE]. m ^ ne nm it as L — > oo, by 
the ergodicity of the power-gain process each "typical" realization of the power-gain process 
can support a reliable rate of communication which is arbitrarily close to 



where SNR := P/Nq denotes the transmit signal-to-noise ratio. Thus, C er9 (SNR, F G ) is both 
the Shannon capacity (appropriately known as the ergodic capacity [1]) and the expected 
capacity in the limit as L — > oo. 

Formally, let us denote by C eX p(SNR, F G , L) the expected capacity of the block-fading 
channel ([I]) for which the transmit signal-to-noise ratio is SNR, the power-gain distribution is 
F G (-), and communication is allowed to span (at most) a total of L coherent blocks. Then, 
as mentioned previously, the expected capacity C exp (SNR, F G , L) — > C er9 (SNR, F G ) in the 
limit as L — > oo. As such, the "gap" between the ergodic capacity C er9 (SNR, F G ) and the 
expected capacity C e:r p(SNR, F G , L) represents a fundamental measure of the penalty incurred 
by imposing a delay constraint of L coherent blocks. Such gaps, naturally, would depend on 
the operating transmit signal-to-noise ratio and the underlying power-gain distribution. In 
this paper, we are interested in characterizing the worst-case gaps over all possible transmit 
signal-to-noise ratios and all possible power-gain distributions with a fixed number of different 
possible realizations of the power gain in each coherent block. 

More specifically, for the block-fading channel ([I]) with transmit signal-to-noise ratio SNR 
and power-gain distribution F G (-), let us define the additive and the multiplicative gap between 
the ergodic capacity and the expected capacity under the delay constraint of L coherent blocks 
as 




(2) 



A(SNR, F G , L) : = C er9 (SNR, F G ) - C e ^(SNR, F G , L) 



(3) 



and 



M(SNR, F G , L) : 



C erff (SNR,F G ) 



(4) 



Cea;p(SNR, F G , L) 
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respectively. Focusing on the slow-fading scenario (L = 1), we have the following precise 
characterization of the worst-case additive and multiplicative gaps between the ergodic capacity 
and the expected capacity 

Theorem 1. 

sup A(SNR, Fq, 1) = log if (5) 

SNR,F G 

and 

sup M(SNR, Fq, 1)=K (6) 

SNR,F G 

where the supremes are over all transmit signal-to-noise ratio SNR > and all power-gain 
distribution Fq{-) with K different possible realizations of the power gain in each coherent 
block. 

The above results have both positive and negative engineering implications, which we sum- 
marize below. 

• On the positive side, note that both the ergodic capacity C er9 (SNR, Fq) and the expected 
capacity C exp (SNR, Fq, 1) will generally grow unboundedly in the limit as the transmit 
signal-to-noise ratio SNR — > oo. The difference between them, however, will remain 
bounded for any finite-state fading channels (where K is finite). Similarly, both the 
ergodic capacity C ers (SNR, Fq) and the expected capacity C exp (SNR, Fq, 1) will vanish in 
the limit as the transmit signal-to-noise ratio SNR — > 0. However, the expected capacity 
C ea; p(SNR, Fq, 1) (under the most stringent delay constraint of L = 1 coherent block) can 
account, at least, for a non-vanishing fraction of the ergodic capacity C eT . 9 (SNR, Fq). 

• On the negative side, in the worst-case scenario both the additive gap A(SNR, Fq, 1) and 
the multiplicative gap M(SNR, Fq, 1) will grow unboundedly in the limit as the number 
of different realizations of the power gain in each coherent block K — > oo. Therefore, 
when K is large, delay-limited communication may incur a large expected-rate loss rel- 
ative to the ergodic scenario where there is no delay constraint on communication. For 
continuous-fading channels where the sample space of Fq(-) is infinite and uncountable, 
it is also possible that the expected-rate loss incurred by delay constraints is unbounded. 

On the other hand, one should not be overly pessimistic when attempt to interpret the 
worst-case gap results ([5]) and (Q. First, the above worst-case gap results are derived under 
the assumption that the transmitter does not know the realization of the channel fading at 
all. In practice, however, it is entirely possible that some information on the channel fading 
realization is made available to the transmitter (via finite-rate feedback, for example). This 
information can be potentially used to reduce the gap between the ergodic capacity and the 
expected capacity [T0|[TTj. Second, for specific fading distributions the gap between the ergodic 
capacity and the expected capacity can be much smaller. For example, it is known jl] that 
for Rayleigh fading, the additive gap between the ergodic capacity and the expected capacity 
over one-block delay is only 1.649 nats per channel use in the high signal-to-noise ratio limit, 
and the multiplicative gap is only 1.718 in the low signal-to- noise ratio limit, even though in 
this case the power-gain distribution is continuous. 
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The rest of the paper is organized as follows. Next in Sec. [2j we provide a proof of the worst- 
case gap results (JSJ) and fl6]) as stated in Theorem HJ Key to our proof is an explicit characteriza- 
tion of an optimal power allocation for characterizing the expected capacity C exp (SNR, Fq, 1), 
obtained via the marginal utility functions introduced by Tse [H] . In Sec. El we extend our set- 
ting from the point-to-point fading channel to the problem of writing on fading paper [ToT - TIT] . 
and provide a characterization of the ergodic capacity and the additive expected-capacity loss 
over one-block delay to within one bit per channel use. Finally, in Sec. H]we conclude the paper 
with some remarks. 

Note: In this paper, all logarithms are taken based on the natural number e. 



2 Proof of the Main Results 



2.1 Optimal Power Allocation via Marginal Utility Functions 

To prove the worst-case gap results (jSJ) and as stated in Theorem [TJ let us fix the transmit 
signal-to- noise ratio SNR and the power-gain distribution Fq(-) with K different possible 
realizations of the power gain in each coherent block. Let {g\, . . . ,gx} be the collection of 
the possible realizations of the power gain, and let p^ := Pr(G = g k ) > 0. Without loss of 
generality, let us assume that the possible realizations of the power gain are ordered as 

9i > 92 > ■ ■ ■ > g K > 0. (7) 

With the above notations, the expected capacity C ea: p(SNR, F G , 1) (under the delay constraint 
of L — 1 coherent block) is given by [I] 

subject to = p < fix < f3 2 < ■ ■ ■ < 0k < 1 

where 

k 

F k :=J2Pr ( 9 ) 

Note that the optimization program flSJ) with respect to the cumulative power fractions 
(/?!, . . . , (3k) is not convex. However, the program can be convexified via the following simple 
change of variable [12] 

( 1 + /^ fc SNR \ , 1 ^ . . 

r fc :=log — — — — , k = l,...,K. (10) 



P k - igk SNRJ 

In the preliminary version of this work [13], this venue was further pursued to obtain an 
implicit characterization of the optimal power allocation via the standard Karush-Kuhn- Tucker 
conditions. Below we shall consider an alternative and more direct approach which provides 
an explicit characterization of an optimal power allocation via the marginal utility functions 
(MUFs) introduced by Tse 
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Assume that qk > (which implies that g k > for all k = 1, . . . , K), and let n k := l/g k 
for k — 1, . . . , K. Given the assumed ordering ([?]) for the power-gain realizations {g±, . . . , gx}, 
we have 

< ni < • • • < n K - (11) 
Following P3], let us define the MUFs and the dominating MUF as 

u k (z):=— k = l,...,K (12) 

n k + z 



and 



m*(2;) := max Uk(z) (13) 

k=l,...,K 



respectively. Note that for any k = 1, . . . , K, u k (z) > if and only if z > —n k . Also, for any 
two distinct integers k and I such that 1 < k < I < K the MUFs u k (z) and ui(z) has a unique 
intersection at z = z kj i where 

= — f — > _ri fc- (14) 



n k + + z K i F t 

Furthermore, we have u k (z) > ui(z) > if and only if — n k < z < z k ,i, and ui(z) > u k {z) > 
if and only if z > Zk,i [II]- For the rest of the paper, the above property will be frequently 
referred to as the single crossing point property of the MUFs (see Fig. [Hfor an illustration). 

Next, let us use the single crossing point property of the MUFs to obtain an explicit 
characterization of the dominating MUF, which will in turn lead to an explicit characterization 
of an optimal power allocation. Let us begin by defining a sequence of integers {tti, . . . , 717} 
recursively as follows. 

Definition 1. First, let tx\ = 1. Then, define 



7Tj + i := max 



arg mm z^ a 

l =7H +l,...,K 



(15) 



where I is the total number of integers {7Tj} defined through the above recursive procedure. 

Note that in the above definition, a "max" is used to break the ties for achieving the "min" 
inside the brackets, so there is no ambiguity in defining the integer sequence {7Ti, . . . , 717}. 
Clearly, we have 

1 = < 7r 2 < • • • < 7T7 = K. (16) 

Furthermore, we have the following properties for the sequence {z m>7T2 , z n2)ir3 , . . . , z ni _ 1)7rj }, 
which are direct consequences of the recursive definition (1T51) and the single crossing point 
property of the MUFs. 

Lemma 1. 1) For any % = 1, . . . , I — 1 and any I — 7Tj + 1, . . . , K, we have 

z-Ki,-K i+1 < (17) 
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ui(z) u k (z) 




z 



-ni -n k Zk,i 



Figure 1: The single crossing point property between the MUFs Uk{z) and ui(z) for k < I. 

2) For any i = 1, — 2, we have 

3) For any i — 1, — 1 and any 1 = 1,..., — 1, we have 

Proof. Property 1) follows directly from the recursive definition ([15]) . 

To prove property 2), let us consider proof by contradiction. Assume that z nij7Ti+1 > z ni+1 ^ ni+2 
for some i G {1, ... ,1 — 2}. By property 1), we have z n . j7ri+2 > z^ i+1 > z^. +1 ^ i+2 . Following 
the single crossing point property, we have < u ni+1 (z^ ni+2 ) < u^^z^^) = u^z^^). 
Using again the single crossing point property, we may conclude that — n 1Ti < z^ i<KiJr2 < z ni ,Tr i+1 - 
But this contradicts the fact that z ni)7Ti+2 > z nu7Ti+1 as mentioned previously. This proves that 
for any i = 1, 2, we must have z^ i<Ki+x < z 7Ti+u7Ti+2 . 

To prove property 3), let us fix i G {1,...,/ — 1}. Note that the desired inequality 
f fl9|) holds trivially with equality for I = 7Tj, so we only need to consider the cases where 
I G {-Ki + 1, . . . , 7r i+ i - 1} and I G {1, . . . , 7Tj - 1}. 

For the case where I G + 1, . . . , 7r i+1 — 1}, by property 1) we have — n ni < z 1TulTl+1 < 
z nu i. Following the single crossing point property we have < ui(z ni)7ri+1 ) < u ni (z ni ,TT i+1 ) = 
UT, i+1 {z^ i+1 ) , which in turn implies that z num+1 > zi^ i+1 - 
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Figure 2: An illustration of the dominating MUF. In this example, we have K — 4 and 
z i,3 < z i,2 < 2^1,4- Therefore, we have 7 = 3, i\\ = 1, 7r 2 = 3, and 7r 3 = 4. The dominating 
MUF = ui(z) for z G (—711,21,3), u*(z) = 7/3(2) for z G (21,3,23,4), and u*(z) = 1*4(2;) for 

2 e (23,4,00). 

For the case where / G {1, . . . , 7Tj — 1}, let us assume, without loss of generality, that / G 
{ir m , . . . ,7r m+ i — 1} for some m G {1, . . . By the previous case we have 2 7rm , 7rm+1 > z^ m+1 

and hence 

< ui{z) < u Wm+1 (z) \/z > z nmtnm+1 . (20) 

Also note that 



u n , n+1 {z) < u« m+2 (z) < ■ ■ ■ < u nt+1 (z) \/z> max z Wj . 

m+l<j<i 



By property 2) we have 



m+l<7<i 



(21) 



(22) 



Combining ([20])-(j22]) gives < Ui(z Wit7Vi+1 ) < u^. +1 {z^ i+1 ) , which in turn implies that z^ i+1 > 

z l,n i+ i ■ 

Combing the above two cases completes the proof of property 3) and hence the entire 
lemma. □ 
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The following proposition provides an explicit characterization of the dominating MUF (see 
Fig. [2] for an illustration). 

Proposition 1 (Dominating marginal utility function). For any i = and any z G 

(^i-i.TTij^Tri.TTi+J, the dominating MUF 

u*(z)=u^(z). (23) 

where we define z 7T0 ^ 7T1 := —n\ and z nit7TI+1 := oo for notational convenience (even though 7To 
and 7Tr + i will not be explicitly defined). 

Proof. Fix i G {1,...,/}. Let us show that u Vi {z) > Ui(z) for any z G (z Wi _ 1:7T ., z nu7T . +1 ) by 
considering the cases I > TXi and / < txi separately. 

For / > 7Tj, by the single crossing point property we have < u\{z) < u ni (z) for any 
—n^. < z < z Wi> i. By property 1) of Lemma (TJ for any I > 7Tj we have z ni)7Ti+1 < z^j. Combined 
with the fact that z 7ri _ 1)7H > — Ti ni (the equality holds only when i = 1 by the definition of 
z noni and the fact that 7Ti = 1), we may conclude that for I > m, u ni (z) > ui(z) for any 
z G (z 7r ._ lt7T ., z nit7Ti+1 ]. 

For / < Hi, by property 3) of Lemma[T]we have z Wi _ lt% . > Z[ i7Ti and hence < ui(z) < u ni (z) 
for any z > z ni _ 1>1H . 

Combining the above two cases completes the proof of the proposition. □ 

Now, let (PI, . . . , fi* K ) be an optimal solution to the optimization program (JSJ). Then, the 
expected capacity C exp (SNR, F G , 1) can be bounded from above using the dominating MUF as 
follows: 

CWSNR^.D = f>l og (-^±|^) (24) 
= V / u k (z)dz (25) 



K /-/S£SNR 

V / ' u*(z)dz (26) 

k=l ^-iSNR 



< 



u*(z)dz (27) 

' /3£ SNR 
»SNR 

< / u*(z)dz (28) 



where (|26|) follows from the fact that for any k = 1, . . . , K we have fi^-i — 0t anc ^ u k{ z ) < it* (2) 
for all z, and (J3SD follows from the fact that (3q = 0, fl* K < 1, and u*(z) > for all z > 0. The 
equalities hold if (/?*, . . . , fi* K ) satisfies 

u*(z) = u k (z) Vz G (/^SNR, $SNR) (29) 

for any k = 1, . . . , K and fi* K = 1. 
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Note that by property 3) of Lemma HJ we have 
To proceed, let us define two integers s and e as followfl 

Definition 2. Lei s be the largest index i G {1, . . . , 7} such that z ni _ 1)7ri < and let e be the 
largest index i G {1, . . . , 7} snc/i t/iat z 7rj _ 1 ^ < SNR. 

Clearly, we have 1 < s < e < I. Furthermore if s = e, we have 

• • • < < < SNR < z Ws+1 < • • • (31) 

and if s < e, we have 

' ' ' ^ ^7r s _i,7r s ^ < £-7r s ,7r s+ i ^ ' ' " ^ 2-7r e _i,7r e < SNR < Z7r e ,7r e+ i < • • ■ (32) 

Using the definition of s and e, we have the following explicit characterization of an optimal 
power allocation. 

Proposition 2 (An optimal power allocation). Assume that gx > 0. Then, an optimal 
solution (PI, . . . , fi K ) to the optimization program (jSJ) is given by 

{0, for 1 < k < tt s 

z nis7H+ jSNR, for iTi <k < 7r i+ i and i = s, . . . , e - 1 (33) 
1, /or 7r e < fc < K. 

Proof. Note that we always have j3 K = 1. Therefore, in light of the previous discussion, it 
is sufficient to show that the choice of (/?*,..., (3 K ) as given by ( 1331 satisfies (1291) for any 
k = 1, . . . , K. Also note that for the choice of fl33|) . we only need to consider the cases where 
k — TTi for i — s, . . . , e. Otherwise, we have fi^-i = so the open interval ( y 5^_ 1 SNR, /3^SNR) 
is empty and hence there is nothing to prove. 

Let us first assume that s — e. In this case, we only need to consider k = tt s , for which 
(3* k ^ = and (3* k = 1. By Proposition H u*(z) = u ns (z) for any z G {z^^, z ns ^ s+1 ). By flSJ), 
z^s-uKa < an d ^7r s ,7r s+ i > SNR. We thus conclude that u*(z) = u ns (z) for any z G (0, SNR). 

Next, let us assume that s < e. We shall consider the following three cases separately. 

Case 1: k — ir s . In this case, fi^-i = an d 0k = ^vr s ,7r s+ i/SNR. By Proposition [I] 
u*(z) = u ns (z) for any z G (z ns _ u7Ts , z Vs ^ s+1 ). By ([32]), < 0. We thus conclude that 

u*(z) = u^ s (z) for any z G (0, z^^+J. 

Case 2: k = for some z G {s + 1, . . . , e — 1}. In this case, = z^^/SNR and 
0k = z 7ri,7r i+ i/SNR. By Proposition HI u*(z) = ^(z) for any z G [z %i _ ul ,.,z^ i+1 ). 

Case 3: fc = n e . In this case, = z 7re _ 1)7re /SNR and = 1. By Proposition [I] 

w*(z) = ^(z) for any z G (^ e _ 1)7re , ^,7^ )■ By ([32]), ^7r e ,7r e+1 > SNR. We thus conclude that 
u*(z) = u n& (z) for any z G (aw e _ 1|ffe , SNR). 

We have thus completed the proof of the proposition. □ 

lr The integer e here is not to be confused with the natural number e. 
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Figure 3: An optimal power allocation obtained via the dominating MUF. 

Note from (JSJ) that the power allocated to the fading state is given by — /3fc_i)SNR. 
Thus for the optimal power allocation given by ( 13"3"|) . the only fading states g% that are assigned 
to nonzero power (i.e., 01 > are those with k = iTi for i = s, . . . , e (see Fig. |3] for an 

illustration). This provides an operational meaning for the integer sequence {tti, . . . , 717} and 
the integers s and e defined earlier. 

Building on Proposition |2] we have the following characterization of the expected capacity 
C ea; p(SNR, F G , 1), which will play a key role in proving the desired worst-case gap results (J5J) 
and (E])- The proof mainly involves some straightforward calculations and hence is deferred to 
Appendix |X] to enhance the flow of the paper. 

Proposition 3 (Expected capacity over one-block delay). Assume that gx > and let 

for 1 < k < 7i s 

for 7r m _! < k < ir m and m = s + 1, . . . , e (34) 
for 7r e < k < K. 




n» e +SNRF IS 
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Then, the expected capacity C ea; p(SNR, F G , 1) can be written as 
C exp (SNR,F G ,l) 

K 

= ^p fc logA fc (35) 



k=l 



(36) 



2.2 Additive Gap 

To prove the worst-case additive gap result (JSJ), we shall prove that sup SNR F v4(SNR, F G , 1) < 
log-K" and sup SNR)F A(SNR, F g , 1) > log-fT separately. 

Proposition 4 (Worst-case additive gap, converse part). For any transmit signal-to-noise 
ratio SNR and any power-gain distribution Fq(-) with K different realizations of the power 
gain in each coherent block, we have 

A(SNR,F G ,1) <\ogK. (37) 



Proof. Let us first prove the desired inequality (1371) for the case where gx > 0. In this case, 
by Proposition [3] the additive gap A(SNR, F G , 1) can be written as 

A(SNR,F G) 1) = VVlog ( Uk + SNR ) - VVlogA* (38) 

= 1>M~^J- (39) 

We have the following lemma, whose proof is rather technical and hence is deferred to Ap- 
pendix IB1 

Lemma 2. For any k = 1, . . . , K , we have 

n fc + SNR 1 

— — < — . (40) 

n k ^k Pk 

Substituting (HOI) into (1391) . we have 

K 

A(SNR,F G ,1) < ^p fc bg(l/p fc ) =: H(F G ) < logK (41) 



k=l 



where H(F G ) denotes the entropy of the power-gain distribution F G (-), and the last inequality 
follows from the well-known fact that a uniform distribution maximizes the entropy subject to 
the cardinality constraint. This proves the desired inequality (|37|) for the case where gx > 0. 
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For the case where qk = 0, let us consider a modified power-gain distribution F' G {-) with 
probabilities p' k — p^ for all k = 1, . . . , K and g' k = for all k = 1, . . . , K — 1. While we have 
gx = for the original power-gain distribution F G (-), we shall let g' K = e for some 



< e < min 

k=l,...,K-l 



F fc )SNR + n fc 



(42) 



By ( fl4l) . this will ensure that 



*U = F " /e "* > SNR, V* = 1,... JJ K'-1. (43) 
By the definition of e', 2// / < SNR so we must have ir' e , 7^ i\~ and hence Tr' e , < K. By 

e'— l 1 e' 

Proposition [21 this implies that = P'*k-i so the fading state g^- are assigned to zero power 
for the given power allocation (/?'*, . . . , /3'#). Hence, the given power allocation (/3'^, . . . , /3'^-) 
achieves the same expected rate for both power-gain distributions Fq(-) and F G (-). Since 
. . . , (3'* K ) is optimal for the power-gain distribution F G (-) but not necessarily so for F G (-), 
we have 

C exp (SNR, F G , 1) > C ex . p (SNR, F G , 1). (44) 

On the other hand, improving the realizations of the power-gain can only improve the channel 
capacitjQ, so we have 

C er9 (SNR, F G ) < C er9 (SNR, F G ). (45) 

Combining (j44p and (1451) gives 

A(SNR, F G , 1) = C er9 (SNR, F G ) - C ea;p (SNR, F G , 1) (46) 

< C er . 5 (SNR, F G ) - C e:Ep (SNR, F' G) 1) (47) 
= A(SNR,F G ,1) (48) 

< log K (49) 

where the last inequality follows from the previous case for which g' K = e > 0. This completes 
the proof for the case where gx = 0. 

Combing the above two cases completes the proof of Proposition HI □ 

Proposition 5 (Worst-case additive gap, forward part). Fix SNR and K, and consider the 
power-gain distributions F G \-) with 

K ~ k+1 Al jK—k+1 1 \ 

9k = £ d3 = ( d _T (50) 

/or some d > max [(A' — 1)/SNR, 2] and uniform probabilities p^ — l/K for all k = 1, . . . , K . 
For this particular parameter family of power- gain distributions, we have 

lim A(SNR, F [ G \ 1) = log K. (51) 

d— >oo 



2 By the same argument, we also have C ea; p(SNR, F G , 1) < C e:z .p(SNR, F' G , 1) and hence C e:C p(SNR, F G , 1) = 
C e xp(SNR, F' Gl 1), even though this direction of the inequality is not needed in the proof. 
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Proof. For the given pair, it is straightforward to calculate that for any 1 < k < 

I < K 

n k + z kd l-k+1 d K ~ l - 1 d l ~ k - 1 l — k+l d l ~ k - 1 



n k + Zfc,j + i Z - fc cZ^ +1 - 1 - 1 l-k d l ~ k + l - 1 

where the last inequality follows from the fact that d > 1. Since I — k > 1 and cZ > 2, we have 
(Z - jfe + l)(d l - k - 1) - (Z - A;)(d'~ fc+1 - 1) = [1 - (Z - k){d - 1)] - 1< 0. (53) 
Substituting ( 1531) into ( l5"2"j) gives 

^fc + Zjfej 



< 1 (54) 



which immediately implies that Zk,i < Zkj+i for any 1 < k < I < K. We thus have I = K and 
7Tj = z for alH = 1, . . . , K. Since d > max{(i^ — 1)/SNR, 2}, we have 

Zl , 2 = £l2)*+± > (55, 
(d - l)gig 2 

and 

(if-lHd + d 2 ) -ATd K-l n 

z «-^ = - — mm — < d < < 56) 



so by definition we have s — 1 and e = K. Thus, by the expression of from ( IMI) we have 

(Ef =1 *)(i+SNR-d) k = 1 



It follows that 



Afc ~ ' (gj^jj) (Ef^ fc + 2 ^ (l+SNR-d) ( 57 ) 



n 1 + SNR ^ (l + SNR- Ef =1 ^') d 
n i A i (Ef =1 *j (l + SNR-d) 

SNR-d^ + OOT 

SNR-d^+ 1 + 0(d^) 1 J 

->■ K (60) 



in the limit as d — >■ oo and 



nn-SNR (l+SNR-E^ 1 ^)^ 

a = K ' "7 w <r (ol) 

n * A * (Ef="i fc+1 # j [Ef=i +2 * j ( 1 + SNR • d) 

SNR-ci 2 ^- fc )+ 4 + 0(d 2 ^- fc ) +3 ) 

~ ' SNR • rf2(X-fc)+4 + ( d 2(ir-fc)+3) ^ ) 

->■ AT (63) 
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in the limit as d — > oo for any k — 2, . . . , K. By ([59]) . the additive gap 

A(SNR,F» = tog (2*^5) (64) 
- 1 

-^-logK (65) 
fe=i 

= logK (66) 

in the limit as d — > oo. This completes the proof of Proposition [5j □ 

Combining Propositions S] and completes the proof of the desired worst-case additive gap 
result ©. 

2.3 Multiplicative Gap 

Similar to the additive case, to prove the worst-case multiplicative gap result ([6]) we shall prove 
that supg NR FG M(SNR, F G , 1) < K and sup SNR F(3 A(SNR, F G , 1) > K separately. 

Proposition 6 (Worst-case multiplicative gap, converse part). For any transmit signal-to- 
noise ratio SNR and any power-gain distribution Fq(-) with K different realizations of the 
power gain in each coherent block, we have 

M(SNR, Fq, 1) < K. (67) 

Proof. Let us first prove the desired inequality (|67|) for the case where gx > 0. By definition 
the multiplicative gap A(SNR, F G , 1) can be written as 



« Pk log (*±fS) 



(6? 



k= 

We have the following lemma, whose proof is deferred to Appendix [Cl 
Lemma 3. For any k = 1, . . . , K, we have 



Pk tog 



n fc +SNR 



C exp (SNR,F G ,l) 
Substituting f )69|) into f )68|) . we have 



< I- (69) 



M(SNR,F G ,1)<^1 = K (70) 



k=l 



This proves the desired inequality (|BT|) for the case where > 0. 
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For the case where gx = 0, we can use the same argument as for the additive case. 
More specifically, a modified power-gain distribution F' G {-) can be found such that g' K > 0, 
C e xp(SNR, F' G , 1) = C exp (SNR,F G ,l), and C er9 (SNR, F' G ) > C er9 (SNR, F G ). Thus, the multi- 
plicative gap 

< Cerg(SNR, Fg) 
Cea:p(SNR, F G , 1) 

= M(SNR,F G ,1) (73) 

< # (74) 

where the last inequality follows from the previous case for which g' K > 0. This completes the 
proof for the case where gx = 0. 

Combing the above two cases completes the proof of Proposition [61 □ 

Proposition 7 (Worst-case multiplicative gap, forward part). Fix SNR and K , and consider 
the power-gain distributions Fq\-) with 

k 

n k = Yl dj ( 75 ) 

for some d > and 

d k 

Vk = K (76) 

/or all k = 1, . . . ,K. For this particular parameter family of power- gain distributions, we have 

lim M(SNR, F G d) , 1) = K. (77) 

a!— >oo 

Proof. Note that for the given (SNR, F^P) pair, 

^ = £^■ = 1^ (78) 

so 

z k ,i = Fk l l - F f k = 0, Vl<fc<Z<if. (79) 

fl — r k 

We thus have I = 2, tti = 1, tt 2 = K, and s = e = 2. By the expression of from (1341) . we 
have 

nx + SNR 

A fc = , Vk = l,...,K. (80) 

% 

It follows that the expected capacity 

C exp (SNR,F^\l) = f>logA fc = log ( UK + SNR ) . (81) 
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We thus have 



C ex . p (SNR,4 d) ,l) " log(^P 



(82) 



* ~ J T^ (83) 

n k + bNR 
d k 

(84) 



E -=i * + SNR 

1 (86) 
in the limit as d — > oo for any k — 1, . . . , K, where (183]) follows from the well-known inequalities 



so we 



^ < log(l +z) < x, Vx>0, (87) 
have log ( nfc t S fc NR ) > nfc S + N S NR and log ( " K ^ NR ) < • On the other hand, by Lemma| 



^log(^) 



< 1 



Cexp(SNR, Fq \ 1) 

for any k — 1, . . . , K. Combining (I8"6"j) and 088p proves that 



Cexp(SNR, Fgf , 1) 



59) 



in the limit as d — > oo for all k — 1, . . . , K. By (1681) . the multiplicative gap 



* P*log(**P 



A* 



A/(SNR, F , 1) = £ ' V.. £ 1 = * < 9 °) 

in the limit as d — > oo. This completes the proof of Proposition [71 □ 

Combining Propositions |6] and [7J completes the proof of the desired worst-case multiplicative 
gap result <Q. 

3 Writing on Block-Fading Paper 

Consider the problem of writing on fading paper [T5HT7]: 



Y[t] = VG\t](X[t] + S[t}) + Z[t] (91) 
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where are the channel inputs which are subject to an average power constraint of 

P, {G[t}} are the power gains of the channel fading which we assume to be unknown to 
the transmitter but known at the receiver, {S^i]} and are independent additive white 

circularly symmetric complex Gaussian interference and noise with zero means and variance 
Q and No respectively, and are the channel outputs. The interference signal {£[£]} 

are assumed to be non-causally known at the transmitter but not to the receiver. Note here 
that the instantaneous power gain G[t] applies to both the channel input X[t] and the known 
interference S[t], so this model is particularly relevant to the problem of precoding for multiple- 
input multiple-output fading broadcast channels. 

As for the point-to-point fading channel (CD), we are interested in characterizing the worst- 
case expected-rate loss for the slow-fading scenario. However, unlike for the point-to-point 
fading channel (JTJ, the ergodic capacity of the fading-paper channel (19T|) is unknown. Below, 
we first characterize the ergodic capacity of the fading-paper model (l9Tj) to within in one 
bit per channel use. As we will see, this will also lead to a characterization of the additive 
expected-capacity loss to within one bit per channel use for the slow-fading scenario. 

3.1 Ergodic Capacity to within One Bit 

Denote by C/^(SNR, INR, Fq) the ergodic capacity of the fading-paper channel ( l9Tj) with 
transmit signal-to-noise ratio SNR := P/Nq, interference-to-noise ratio INR := Q/Nq, and 
power-gain distribution Fq(-). We have the following characterization of C/^ 9 (SNR, INR, Fq) 
to within one bit. 

Theorem 2. For any transmit signal-to-noise ratio SNR, any transmit interference-to-noise 
ratio INR, and any power- gain distribution Fq(-), we have 

C erg (SNR,F G ) - log 2 < C/p(SNR,INR,F g ) < C er9 (SNR, F G ) (92) 

where C erg (SNR, Fq) is the ergodic capacity of the point-to-point fading channel (QJ) of the same 
signal-to-noise ratio and power-gain distribution as the fading-paper channel (I9ip . 

Proof. To show that C/^ 9 (SNR, INR, Fq) < C er9 (SNR, Fq), let us assume that the interference 
signal {5*^]} are also known at the receiver. When the receiver knows both the power gain 
and the interference signal {^[t]}, it can subtract { y^^M^M) fro m the received signal 
This will lead to an interference-free point-to-point fading channel ([TTh whose ergodic 
capacity is given by C erfl (SNR, Fq). Since giving additional information to the receiver can 
only improve the ergodic capacity, we conclude that (SNR, INR, F G ) < C er5 (SNR, Fq). 
To show that C{* g (SNR, INR, Fq) > C erfl (SNR, Fq) - log 2, we shall show that 

i? = E G {[log(G-SNR)] + } (93) 

is an achievable ergodic rate for the fading-paper channel (19T|) . where x + := max(x, 0). Since 

[log(G • SNR)] + > log(l + G ■ SNR) - log 2 (94) 
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for every possible realization of G, we will have 



C£ ff (SNR, INR, F G ) > E G { [\og(G ■ SNR)] + } (95) 
> E G [log(l + G ■ SNR)] - log 2 (96) 
= C er9 (SNR,F G )-log2. (97) 

To prove the achievability of the ergodic rate ( 1931) . we shall consider a communication 
scheme which is motivated by the following thought experiment. Note that with ideal in- 
terleaving, the block-fading channel (|9Tj) can be converted to a fast-fading one for which the 
power gains {£?[£]} are independent across different time index t. Now that the channel is 
memoryless, by the well-known result of Gel'fand and Pinsker [18] the following ergodic rate 
is achievable: 



R = max 

(X,U) 



I(U;VG(X + S) + Z\G)-I(U;S) (9 



where U is an auxiliary variable which must be independent of (G, Z). An optimal choice of 
the input-auxiliary variable pair (X, U) is unknown [TSKTB] . Motivated by the recent work [20] , 
let us consider 

U = X + S (99) 

where X is circularly symmetric complex Gaussian with zero mean and variance P and is 
independent of S. For this choice of the input- auxiliary variable pair (X, U), we have 

/([/; VG(X + S) + Z\G) - I(U; S) (100) 
= E G [log(l + G(SNR + INR))] - log (^|±I^ (i i) 

> E G [log(G(SNR + INR))] - log (^|±^) (i 02 ) 

> Eg [log(G • SNR)] . (103) 

This proves that 

R = {E G [log(G ■ SNR)] } + (104) 

is an achievable ergodic rate for the fading-paper channel (!9T|) . 

Note that even though the achievable ergodic rate (j!04p is independent of the transmit 
interference-to-noise ratio INR, it is not always within one bit of the interference-free ergodic 
capacity C er . 5 (SNR, F G )- Next, motivated by the secure multicast code construction proposed 
in [2T|, we shall consider a variable-rate coding scheme that takes advantage of the block-fading 
feature to boost the the achievable ergodic rate from fjlQ4j) to fl93|) . 

Fix e > and let (U,X) be chosen as in fl99l) . Consider communicating a message 
W G {1, • • • , e LTcR } over L coherent blocks, each of a block length T c which we assume to 
be sufficiently large. 

Codebook generation. Randomly generate L codebooks, each for one coherent block and 
consisting of Q T c( LR + I ( u '> s )+ e ) codewords of length T c . The entries of the codewords are indepen- 
dently generated according to Pjj. Randomly partition each codebook into e LTcR bins, so each 
bin contains e Tc ( I ( u ' s )+ t ) codewords. See Fig. H]for an illustration of the codebook structure. 



18 



Codebook 1 



Codebook 2 



Codebook L 



Bin 1 



Bin 2 



Bin e LTcR 



Bin 1 



Bin 2 



Bin e LTcR 



Bin 1 



Bin 2 



Bin e LTcR 



Figure 4: The codebook structure for achieving the ergodic rate ( 193ft . Each codeword bin in 
the codebooks contains e Tc ^ I ^ U;S ^ +e ' codewords. 

Encoding. Given the message W and the interference signal S LTc := {S[l}, . . . , S[LT C }), 
the encoder looks into the Wth bin in each codebook I and tries to find a codeword that is 
jointly typical with Sf°, where Sf c := (S[(l — 1)T C + 1], . . . , S^TJ) represents the segment of 
the interference signal S LTc transmitted over the Zth coherent block. By assumption, T c is 
sufficiently large so with high probability such a codeword can be found in each codebook |19j . 
Denote by Uj c := (U[(l — 1)T C + 1], . . . , U[IT C ]) the codeword chosen from the Zth codebook. 
The transmit signal Xj c := (X[(l - 1)T C + 1], . . . , X[IT C }) over the Zth coherent block is given 
by Af« = iff- - 

Decoding. Let Gi be the realization of the power gain during the Zth coherent block, and 

let 

£:={/: I{U- y/Gl(X + S) + Z) - I(U; S) > 0}. (105) 

Given the received signal Y LTc := (Y [1], . . . , Y [LT C ]), the decoder looks for a codeword bin 
which contains for each coherent block I £ £, a codeword that is jointly typical with the 
segment of Y LTc (£) received over the Ith coherent block. If only one such codeword bin can 
be found, the estimated message W is given by the index of the codeword bin. Otherwise, a 
decoding error is declared. 

Performance analysis. Note that averaged over the codeword selections and by the union 
bound, the probability that an incorrect bin index is declared by the decoder is no more than 

JJ e T c (/(C/;5)+ e ) . e -T c [l(U;VGi(X+S)+Z)-e] _ ^T c Y, leC [l(U;VGl(X+S)+Z)-I(U;S)-2e] _ 

lec 

Thus, by the union bound again, the probability of decoding error is no more than 

Q T C LR . e ^T c J2 leC [l(U;VG^(X+S)+zyi(U:S)-2e\ _ e -T c {j2 leC [l(U-,VUi(X+S)+Z)-I(U;S)-2e\-LR} _ ^q-jA 
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It follows that the transmit message W can be reliably communicated (with exponentially 
decaying error probability for sufficiently large T c ) as long as 



U(U] VGi(X + S) + Z)- I{U- S) - 2e 



lec 



LR>0 



or equivalently 



Note that 



R < -jr [ J (^ VGi(X + S) + Z)- I(U; S) - 2e 



- \nU; VGi(X + S) + Z)- I(U; S) - 2e 



lec 



L 



U(U; y/G^X + S) + Z)- I(U- S) 



lec 



2\C\ 



^ t E l 1 ^ v^( x +s)+z)- m s) 



lec 

L 



— 2,6 



1=1 
1 L 

> ^ [log(G, ■ SNR)] + - 2e 



+ 



2e 



(108) 
(109) 

(110) 
(111) 
(112) 

(113) 

(114) 



where (I112j) follows from the fact that \C\ < L, (11131) follows from the definition of £ from 
(11051) . and (11141) follows from (I103p . Finally, By the weak law of large numbers, 



1 L 

- [M^, ■ SNR )] + ~> e g { [log(G ■ SNR)] + } 



(115) 



in probability in the limit as L — > oo. We thus conclude that (193]) is an achievable ergodic rate 
for the fading-paper channel (19 ip . 

We have thus completed the proof of Theorem □ 



3.2 Additive Expected-Capacity Loss to within One Bit 

Let (SNR, INR, F G , L) be the expected capacity of the fading-paper channel (J9"l~j) under the 
delay constraint of L coherent blocks, and let A^(SNR, INR, F G , L) := C/ r p 9 (SNR, INR, F G ) - 
Cg,(SNR, INR, F G , L) be the additive gap between the ergodic capacity C/ r p g (SNR, INR, F G ) 
and the expected capacity C/^ p (SNR, INR, F G , L). We have the following results. 

Theorem 3. For any transmit signal-to-noise ratio SNR > 0, any transmit interference-to- 
noise ratio INR > ; and any power-gain distribution F G (-), we have 

A(SNR,F G ,1) - log 2 < A /P (SNR,INR,F G ,1) < A(SNR, F G , 1). (116) 



20 



Proof. We claim that for any transmit signal-to-noise ratio SNR > 0, any transmit interference- 
to-noise ratio INR > 0, and any power-gain distribution Fq(-), we have 

C^(SNR, INR, Fq, 1) = C exp (SNR, F G , 1). (117) 

Then, the desired inequalities in (I116P follow immediately from the above claim and Theorem [2j 
To prove ( 11171) . let us consider the following i^-user memoryless Gaussian broadcast chan- 
nel: 

Y k = Jgi(X + S) + Z, k = l,...,K (118) 

where X is the channel input which is subject an average power constraint, S and Z are 
independent additive white circularly symmetric complex Gaussian interference and noise, and 
<7fc and are the power gain and the channel output of user k, respectively. The interference 
S is assumed to be non-causally known at the transmitter but not to the receivers. Similar to 
the interference-free (scalar) Gaussian broadcast channel, the broadcast channel (11181) is also 
(stochastically) degraded. Furthermore, Steinberg [22] showed that through successive Costa 
precoding [19] at the transmitter, the capacity region of the broadcast channel (1 1 1 8 j) is the 
same as that of the interference-free Gaussian broadcast channel. We may thus conclude that 
the expected capacity C/j^SNR, INR, Fq, 1) of the fading-paper channel ( 19T]) is the same as 
the expected capacity C ea; p(SNR, Fq, 1) of the interference-free point-to-point fading channel 
(PQ) of the same transmit signal-to- noise ratio SNR and power-gain distribution Fq(-). This 
completes the proof of Theorem |3j □ 

Combining Theorems Q] and [3] immediately leads to the following corollary. 

Corollary 4. 

log(AT/2) < sup A /P (SNR,INR,F G ,1) < log AT. (119) 

SNR,INR,F G 

where the supreme is over all transmit signal-to-noise ratio SNR > 0, all transmit interference- 
to-noise ratio INR > 0, and all power-gain distribution Fg(-) with K different possible realiza- 
tions of the power gain in each coherent block. 

4 Concluding Remarks 

For delay-limited communication over block-fading channels, the difference between the ergodic 
capacity and the maximum achievable expected rate for coding over a finite number of coherent 
blocks represents a fundamental measure of the penalty incurred by the delay constraint. This 
paper introduced a notion of worst-case expected-capacity loss. Focusing on the slow-fading 
scenario (one-block delay), it was shown that the worst-case additive expected-capacity loss is 
precisely log K nats per channel use and the worst-case multiplicative expected-capacity loss is 
precisely K, where K is the total number of different possible realizations of the power gain in 
each coherent block. Extension to the problem of writing on fading paper was also considered, 
where both the ergodic capacity and the additive expected-capacity loss over one-block delay 
were characterized to within one bit per channel use. 
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Many research problems are open along the line of broadcasting over fading channels. Unlike 
for the case of one-block delay, the expected capacity of the point-to-point fading channel over 
multiple-block delay is unknown except for the case with two-block delay and two different 
possible realizations of the power gain in each coherent block [8]|9]. The main difficulty there is 
that the capacity region of the parallel Gaussian broadcast channel with a general message set 
configuration remains unknown. With multiple transmit antennas, the expected capacity of the 
point-to-point fading channel is unknown even for one-block delay [I]. Another interesting and 
challenging scenario is the mixed-delay setting, where there are multiple messages of different 
delay requirement at the transmitter. Some preliminary results can be found in [23]. With 
known interference at the transmitter, one may also consider the setting where the channel 
fading applies only to the known interference (the fading-dirt problem) [23] or, more generally, 
different channel fading applies to the input signal and the known interference separately. 



A Proof of Proposition E8 

Let us first rewrite the expression (124]) for the expected capacity C e2; p(SNR, Fq, 1) as follows: 



C exp (SNR,F G ,l) 




nj + /3*SNR \ 
/3* SNR J 



n 3 + yj 



nj + /3*SNR \ 
nj + (3* SNRj 



(120) 
(121) 
(122) 



k=l 



where 



K 



n j + /3;SNR 

j=k J J 



(123) 



and (/?!*,..., P* K ) is given by 

To show that as given by (11231) equals the right-hand side of (1341) . let us first assume 
that s = e. For this case, by fl33]) we have (3* = f3*_ 1 for every j ^ ir s . Thus, substituting fl33l) 
into (11231) gives 

f for, l<jfe<ir, 

Afc = 1 ^ f ^ 
y 1, for tt s < k < K. 

Next, let us assume that s < e. We shall consider the following three cases separately. 



22 



Case 1: k < n s . For this case, substituting ( 133]) into (1 123[) gives 



e-l 



Afc = n na + z Wa+1 / jj 7i T . + ^. |7rj+1 \ r^ e + SNR ^ 



e-l 



n^ + SNHg^ (12?) 



^"7T S -^7Tt_|_1 



+ SNR 



(128) 



where (11271) follows from the fact that the MUFs u n .(z) and u^. +1 (z) intersect at z = z 7T )7T +1 
so we have 



(129) 



Case 2: 7T m _i < k < n m for some mG{s + l,...,e}. For this case, substituting (|33|) into 
(H23D gives 

A fc = (fl n ^ +Z ^A ^ + SNR (130) 
n^ + SNR r^+^^+i (131) 

,,,, + SNR (132) 



7^ + SNR F„ m 

r^ e + SNR - F 7i - m _ 1 
3r. 



(133) 
(134) 



where ffl~32l) follows from (fT29|) . and (TT341) follows from the fact that the MUFs u^^z) and 
u^ m {z) intersect at z = z^ m _ x ^ m so by (fT4|) we have 



F n — F v p p — F 

- t 7T m _l"-7r m 1 7T m lb 1T m -l ^ ^ 7Tm X 7T OT TTm — 1 (135) 

Case 3: fc > n e . For this case, we have (3* = (3*^ = 1 for any j > k. Hence, by (J3"3"j) we 
have 

A fc = 1. (136) 
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Finally, substituting ( 134)) into (|35|) gives 
C exp (SNR,F G ,l) 



^p fc logA Wa + J] ^ pJlogA^ (137) 

k=l m=s+l \ fc=7r m _i+l / 

e 

F 7rs logA 7rs + (^"^-JlogA^ (138) 

m=s+l 

Pn ,„ g (=^^) + 1 ( F fc - log ( :- + ™ r F - -/'-' ) 

V "tt s J m =s+l \ n n m n ir m -l P K e J 



(139) 



\""n s / m =s+l \ ll Km '%m-l / V ^Te / 



(140) 



This completes the proof of Proposition |3j 



B Proof of Lemma [2 



Let us consider the following three cases separately. 

Case 1: k < ir s . For such k, by property 3) of Lemma [1] and the definition of s we have 

F k n -K 8 n k = ^ ^ z ^ ^ q (mi , 

which implies that 

F ~ Fi 

± Its ± f- 

By the expression of A k from (134)) . for k < tt s we have 

n fc + SNR n fc + SNR F^n^ 



s < - E f. (142) 



n k A k 7v e + SNR F^nfc 

< ^+SNR^ (144) 



ra*. + SNR F, 



< — (145) 
Pk 

where (TT44)) follows from (TH2|) . and (fT45)) follows from the fact that n fc + SNR < + SNR, 
F ne < 1, and F k > p k . 

Case 2: 7T m _i < k <n m for some m 6 {s + 1, . . . , e}. For such /c, by (1341 we have 

n k + SNR _ n k + SNR - n^,, F 7re 
«fcA fc n 7re + SNR F nm - F 7Fm _ 1 ri fc ' 
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By property 1) of Lemma [T] we have z 7Tm _ li7Tm < z- Km _ ltk which implies that 

n Tr m -i ' ^7r TO _i,7T m ^ ^7r m _i "I" ^7r m _i,A; n k n ir m -i (147) 



F - F F ~ F Fh - F 

Substituting ( 11471) into (I146p gives 

nfc + SNR < n fc + SNR n k - n^ m-1 < J_ 
n k A k ~ n 1Te + SNRF fe - F 7Tm _ 1 n k ~ p k 

where the last inequality follows from the fact that n k + SNR < n ne + SNR, n k — n %m _ x < n k , 
F We < 1, and F k - F Wm-1 > p k . 

Case 3: k > n e . For such k, by (1541) we have A k = 1 and hence 

n fc + SNR n k + SNR 

1 = • ( 149 ) 

By property 1) of Lemma [1] and the definition of e, we have SNR < z ne ^ e+l < z ne ^ k , which 
implies that 

n k + SNR < n k + z^ k = Fki ^ k ~^\ ( 150 ) 

Substituting ( 1X50]) into ffT49|) gives 

rit + SNR F k n k — n w 1 . 
n fc A fc F fc - F ne n k p k 

where the last inequality follows from the fact that n k — n ne < n k , F k < 1, and F k — F ne > p k . 
Combining the above three cases completes the proof of Lemma [2j 



C Proof of Lemma 3 



Let us begin by establishing a simple lower bound on the expected capacity C exp (SNR, Fq, 1). 
Applying the long-sum inequality 

I> lo gf^ fe *)^^ (152) 



we have 



f - iog © + L {F - - F - ] iog (^fc) a F - iog © ■ (153) 

Substituting ( 11 53ft into the expression of C exp (SNR, Fq, 1) from ( )36|) . we have 

Ceip(SNR, Fq, 1) > F„. '°g (^f ) + '°g ( "" ^ SNR ) ( 154 ) 
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Next we shall prove the desired inequality ( 169]) by considering the following four cases 
separately 

Case 1: k > n e . For such k, by property 1) of Lemma [T] and the definition of e we have 
^7r e ,fc > Zn e ,Tr e+1 > SNR and hence 

n ne + SNR n Ve + z n<i>k n k - n We 

^ Z7 = ~B zt - ■ ( 156 ) 



F ~ F Fu- F 



Thus 



C e>p (SNR,F G ,l) " F i ^,.+snb \ 1 ' 

< Zl^ + SNR (158) 

< (159) 

< 1 (160) 
where flTFTj) follows from ( TT55D . IjlSBj) is due to the well-know inequalities (JHZD so log ( ""^f" 1 ) 
SNR and log f ^e+SNR \ > SNR ijjgm follows from and ([jgm i s due to the fact that 



< 



n k - n^ e < n k and F k - F ne > p k . 

Case 2: k = ir e . For such k, by (1155P we have 



H (^) log ( 



NR \ 



C exp (SNR, F G , 1) " F ^ lQg ^.e+SNR j F. e (161) 
and hence 

C e xp(SNR, F G , 1) ~~ 

Case 3: k = n m for some m G {s, . . . , e — 1}. For this case, we shall show that for any 
m G {s, . . . , e — 1} 

log i 

C e:cp (SNR,F G ,l) " 

and hence 



I'm 



C exp (SNR,F G ,l) " F, 
To prove (j!63p . let us define g(z) := N(z)/D(z) where 



N(z) = log ( n ^ + z ) (165) 



D{z) = Fn log + £ - log + F„ .o g (=£f) ^ (1 66> 



26 



By Lemma [T] and the definition of s and e, we have 

< 2 Ws+1 < Zn m ,n m+1 < Zv m ,* e < Z^-i,*. < SNR. (167) 

By the expression of C e:E p(SNR, Fq, 1) from (!36l) . we have 

log f^M) 

r kmTF n ~ ff(SNR) < snp g(z) (168) 

where the last inequality follows from the fact that z^ m<K& < SNR as mentioned in (11671) . Next, 
we shall show that g(z) < \/F 1Tm at the boundary points z = z nm , ne and z = oo, and for any 
local maximum z* > z^ m ^ e . We may then conclude that 

sup g(z)<l/F 7Tm . (169) 

First, since m < e we have 

g(oc) = 1/F ne < 1/F nm . (170) 
Next, to show that g(z nmt7Te ) < 1/F nm , let us apply the log-sum inequality (1 1 52[) to obtain 



V / i= s +l \ 



and 



E - ^-J log ^ > - i^rj log f ^ . (172) 



i=m+l 



Substituting ( fTTTj) and f lT72|) into (fT66j) gives 



( ^ + - F Km ) log ( f e _f m > ) + F ne log + W * ) 



= ^ log ' e _ " m + *k log " p — (174) 

= ^logf n ^ + We ) (175) 
= i^iV^J (176) 

where ( 11 75ft follows from the fact that the MUFs u nm (z) and ^(z) intersect at z — z nm)7Te so 
we have 

p p p p 

(177) 



It follows immediately from ( I176P that 



»Cw.) = ^ (w.)/^Cw.) < l / F ^- (178) 
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Finally, to show that g(z*) < l/F nm for any local maximum z* > z 7Tmj7Te , let us note that 
g(z) is continuous and differentiable for all z > z nmt7Te so z* must satisfy 



or equivalently 



We thus have 



d , 
dz 



dN{z) 
dz 



D(z) 







dD(z) 
dz 



N(z) 



N(z*) dN{z)/dz 



< 



D(z*) dD(z)/dz 
1 n„ e + z* 

1 



(179) 
(180) 

(181) 
(182) 
(183) 
(184) 



where 01831) follows from the facts that n n > so n7re , z is a monotone decreasing function 

* V "« "m n 7Tm +z ° 

of z for z > and that 2* > z^ e > 0, and ffTMj) follows from f fTTTj) . 

Substituting f 1 1 6 9 j) into (I168P completes the proof of the desired inequality (11631) for Case 

3. 

Case 4: k < ir e but k ^ 7Tj for any i = s, . . . , e — 1. For such fc, let m be the smallest integer 
from {s, . . . , e} such that /c < 7r m . Note that 



Pk log 



"A- 



+SNR\ 



+snr\ 



C ea;p (SNR,F G ,l) 



l oe ^^+snr \ C ea;p (SNR, Fq, I] 

nfc+SNR^ 



Pfclog 



< 



n-k J 



F 1 (n lm +SNR\ 
w ™ 6 \ "-Km J 

Pk 



/(SNR) 



I'm 



where (11861) follows from (JTBTj) for m = e and from H163[) for m = s, 



e — 1, and 



/(*) == 



log (2i±* 



log 



(185) 

(186) 
(187) 



Since < n nm , f(z) is a monotone decreasing function for z > 0. By Lemma [TJ and the 
definition of e, we have 

z k ,n m < z^ m ^ m < z ne _ u7re < SNR. (189) 
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We shall consider the following two sub-cases separately. 

Sub-case 4.1: z k<Km > 0. By the monotonicity of f(z) and the fact that SNR > z k ^ m > 
as mentioned in (11890 . we have 

log (n ' 



/(SNR) < f(z k , nm ) = > l k 4 < ^ + (190) 

log ( Unm+Zk <*™ ) n k 

where the last inequality follows from the inequalities ( 187)) so we have log ( nfc+ 4*'' rm ) < ^ 
and log ( n — +Zk ^ \ > _J±p± — . By Lemma [Hand the fact that k < vr m , we have z w , n > 

^fc,7r m > an d hence m > s + 1. Therefore, k ^ 7r m _i and we must have k > 7r m _i. Again, by 
Lemma [1] we have z fcj7rm < z nm _ u7Tm < z nm _ uk and hence 

n TT m + ^fc,7T m _ f^k + ^fc,7r m , "^fc + ^7T m _l,fc _ n k ~ ^7T m _l / 1nl \ 

F ~ Fu " Fu ~F~^F ' { } 

1 7T m 1 K 1 k 1 k 1 7r m _i 

Substituting ( TT9T]) int ( 1X901) gives 

/(SNR) < "r^" 1 ? < TT-% • (192) 



»m-l 



Further substituting (11921) into flT87J gives 



< < I- (193) 



C e3;p (SNR,F G ,l) " F fc 

Sub-case 4.2: z fc _ < 0. In this case, z k - = Fkn ^~f^ nk < so we have F k n nm < F n n k . 
By the monotonicity of f(z) and the fact that SNR > 0, we have 

/(SNR) < lim/(*) = n nm /n k < F Vm /F k . (194) 



Substituting ( Tl94j) into (1T87)) gives 



< — < 1. (195) 



C e:Ep (SNR,F G ,l) " F k 

Combining the above two sub-cases completes the proof for Case 4. We have thus completed 
the proof of Lemma [3j 
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